97 research outputs found

    Layer-Wise Relevance Propagation for Explaining Deep Neural Network Decisions in MRI-Based Alzheimer's Disease Classification

    Get PDF
    Deep neural networks have led to state-of-the-art results in many medical imaging tasks including Alzheimer’s disease (AD) detection based on structural magnetic resonance imaging (MRI) data. However, the network decisions are often perceived as being highly non-transparent, making it difficult to apply these algorithms in clinical routine. In this study, we propose using layer-wise relevance propagation (LRP) to visualize convolutional neural network decisions for AD based on MRI data. Similarly to other visualization methods, LRP produces a heatmap in the input space indicating the importance/relevance of each voxel contributing to the final classification outcome. In contrast to susceptibility maps produced by guided backpropagation (“Which change in voxels would change the outcome most?”), the LRP method is able to directly highlight positive contributions to the network classification in the input space. In particular, we show that (1) the LRP method is very specific for individuals (“Why does this person have AD?”) with high inter-patient variability, (2) there is very little relevance for AD in healthy controls and (3) areas that exhibit a lot of relevance correlate well with what is known from literature. To quantify the latter, we compute size-corrected metrics of the summed relevance per brain area, e.g., relevance density or relevance gain. Although these metrics produce very individual “fingerprints” of relevance patterns for AD patients, a lot of importance is put on areas in the temporal lobe including the hippocampus. After discussing several limitations such as sensitivity toward the underlying model and computation parameters, we conclude that LRP might have a high potential to assist clinicians in explaining neural network decisions for diagnosing AD (and potentially other diseases) based on structural MRI data

    COVID-19: a simple statistical model for predicting intensive care unit load in exponential phases of the disease

    Get PDF
    One major bottleneck in the ongoing COVID-19 pandemic is the limited number of critical care beds. Due to the dynamic development of infections and the time lag between when patients are infected and when a proportion of them enters an intensive care unit (ICU), the need for future intensive care can easily be underestimated. To infer future ICU load from reported infections, we suggest a simple statistical model that (1) accounts for time lags and (2) allows for making predictions depending on different future growth of infections. We have evaluated our model for three heavily affected regions in Europe, namely Berlin (Germany), Lombardy (Italy), and Madrid (Spain). Before extensive containment measures made an impact, we first estimate the region-specific model parameters, namely ICU rate, time lag between infection, and ICU admission as well as length of stay in ICU. Whereas for Berlin, an ICU rate of 6%, a time lag of 6 days, and a stay of 12 days in ICU provide the best fit of the data, for Lombardy and Madrid the ICU rate was higher (18% and 15%) and the time lag (0 and 3 days) and the stay in ICU (3 and 8 days) shorter. The region-specific models are then used to predict future ICU load assuming either a continued exponential phase with varying growth rates (0-15%) or linear growth. By keeping the growth rates flexible, this model allows for taking into account the potential effect of diverse containment measures. Thus, the model can help to predict a potential exceedance of ICU capacity depending on future growth. A sensitivity analysis for an extended time period shows that the proposed model is particularly useful for exponential phases of the disease

    Promises and pitfalls of deep neural networks in neuroimaging-based psychiatric research

    Full text link
    By promising more accurate diagnostics and individual treatment recommendations, deep neural networks and in particular convolutional neural networks have advanced to a powerful tool in medical imaging. Here, we first give an introduction into methodological key concepts and resulting methodological promises including representation and transfer learning, as well as modelling domain-specific priors. After reviewing recent applications within neuroimaging-based psychiatric research, such as the diagnosis of psychiatric diseases, delineation of disease subtypes, normative modeling, and the development of neuroimaging biomarkers, we discuss current challenges. This includes for example the difficulty of training models on small, heterogeneous and biased data sets, the lack of validity of clinical labels, algorithmic bias, and the influence of confounding variables

    Harnessing spatial homogeneity of neuroimaging data: patch individual filter layers for CNNs

    Full text link
    Neuroimaging data, e.g. obtained from magnetic resonance imaging (MRI), is comparably homogeneous due to (1) the uniform structure of the brain and (2) additional efforts to spatially normalize the data to a standard template using linear and non-linear transformations. Convolutional neural networks (CNNs), in contrast, have been specifically designed for highly heterogeneous data, such as natural images, by sliding convolutional filters over different positions in an image. Here, we suggest a new CNN architecture that combines the idea of hierarchical abstraction in neural networks with a prior on the spatial homogeneity of neuroimaging data: Whereas early layers are trained globally using standard convolutional layers, we introduce for higher, more abstract layers patch individual filters (PIF). By learning filters in individual image regions (patches) without sharing weights, PIF layers can learn abstract features faster and with fewer samples. We thoroughly evaluated PIF layers for three different tasks and data sets, namely sex classification on UK Biobank data, Alzheimer's disease detection on ADNI data and multiple sclerosis detection on private hospital data. We demonstrate that CNNs using PIF layers result in higher accuracies, especially in low sample size settings, and need fewer training epochs for convergence. To the best of our knowledge, this is the first study which introduces a prior on brain MRI for CNN learning

    Uncovering convolutional neural network decisions for diagnosing multiple sclerosis on conventional MRI using layer-wise relevance propagation

    Get PDF
    Machine learning-based imaging diagnostics has recently reached or even superseded the level of clinical experts in several clinical domains. However, classification decisions of a trained machine learning system are typically non-transparent, a major hindrance for clinical integration, error tracking or knowledge discovery. In this study, we present a transparent deep learning framework relying on convolutional neural networks (CNNs) and layer-wise relevance propagation (LRP) for diagnosing multiple sclerosis (MS). MS is commonly diagnosed utilizing a combination of clinical presentation and conventional magnetic resonance imaging (MRI), specifically the occurrence and presentation of white matter lesions in T2-weighted images. We hypothesized that using LRP in a naive predictive model would enable us to uncover relevant image features that a trained CNN uses for decision-making. Since imaging markers in MS are well-established this would enable us to validate the respective CNN model. First, we pre-trained a CNN on MRI data from the Alzheimer's Disease Neuroimaging Initiative (n = 921), afterwards specializing the CNN to discriminate between MS patients and healthy controls (n = 147). Using LRP, we then produced a heatmap for each subject in the holdout set depicting the voxel-wise relevance for a particular classification decision. The resulting CNN model resulted in a balanced accuracy of 87.04% and an area under the curve of 96.08% in a receiver operating characteristic curve. The subsequent LRP visualization revealed that the CNN model focuses indeed on individual lesions, but also incorporates additional information such as lesion location, non-lesional white matter or gray matter areas such as the thalamus, which are established conventional and advanced MRI markers in MS. We conclude that LRP and the proposed framework have the capability to make diagnostic decisions of..

    Nucleus basalis of Meynert predicts cognition after deep brain stimulation in Parkinson's disease

    Get PDF
    INTRODUCTION Subthalamic DBS in Parkinson's disease has been associated with cognitive decline in few cases. Volume reduction of the nucleus basalis of Meynert (NBM) seems to precede cognitive impairment in Parkinson's disease. In this retrospective study, we evaluated NBM volume as a predictor of cognitive outcome 1 year after subthalamic DBS. METHODS NBM volumes were calculated from preoperative MRIs using voxel-based morphometry. Cognitive outcome was defined as the relative change of MMSE or DemTect scores from pre-to 1 year postoperatively. A multiple linear regression analysis adjusted for the number of cognitive domains affected in the preoperative neuropsychological testing and UPDRS III was conducted. To account for other variables and potential non-linear effects, an additional machine learning analysis using random forests was applied. RESULTS 55 patients with Parkinson's disease (39 male, age 61.4 ± 7.5 years, disease duration 10.8 ± 4.7 years) who received bilateral subthalamic DBS electrodes at our center were included. Although overall cognition did not change significantly, individual change in cognitive abilities was variable. Cognitive outcome could be predicted based on NBM size (B = 208.98, p = 0.022*) in the regression model (F(3,49) = 2.869; R2 of 0.149; p = 0.046*). Using random forests with more variables, cognitive outcome could also be predicted (average root mean squared error between predicted and true cognitive change 11.28 ± 9.51, p = 0.039*). Also in this model, NBM volume was the most predictive variable. CONCLUSION NBM volume can be used as a simple non-invasive predictor for cognitive outcome after DBS in Parkinson's disease, especially when combined with other clinical parameters that are prognostically relevant

    Altered Coupling of Psychological Relaxation and Regional Volume of Brain Reward Areas in Multiple Sclerosis

    Get PDF
    Background:Psychological stress can influence the severity of multiple sclerosis (MS), but little is known about neurobiological factors potentially counteracting these effects. Objective:To identify gray matter (GM) brain regions related to relaxation after stress exposure in persons with MS (PwMS). Methods:36 PwMS and 21 healthy controls (HCs) reported their feeling of relaxation during a mild stress task. These markers were related to regional GM volumes, heart rate, and depressive symptoms. Results:Relaxation was differentially linked to heart rate in both groups (t= 2.20,p= 0.017), i.e., both markers were only related in HCs. Relaxation was positively linked to depressive symptoms across all participants (t= 1.99,p= 0.045) although this link differed weakly between groups (t= 1.62,p= 0.108). Primarily, the volume in medial temporal gyrus was negatively linked to relaxation in PwMS (t= -5.55, p(family-wise-error(FWE)corrected)= 0.018). A group-specific coupling of relaxation and GM volume was found in ventromedial prefrontal cortex (VMPFC) (t= -4.89, p(FWE)= 0.039). Conclusion:PwMS appear unable to integrate peripheral stress signals into their perception of relaxation. Together with the group-specific coupling of relaxation and VMPFC volume, a key area of the brain reward system for valuation of affectively relevant stimuli, this finding suggests a clinically relevant misinterpretation of stress-related affective stimuli in MS

    Large scale multifactorial likelihood quantitative analysis of BRCA1 and BRCA2 variants: An ENIGMA resource to support clinical variant classification

    Get PDF
    The multifactorial likelihood analysis method has demonstrated utility for quantitative assessment of variant pathogenicity for multiple cancer syndrome genes. Independent data types currently incorporated in the model for assessing BRCA1 and BRCA2 variants include clinically calibrated prior probability of pathogenicity based on variant location and bioinformatic prediction of variant effect, co-segregation, family cancer history profile, co-occurrence with a pathogenic variant in the same gene, breast tumor pathology, and case-control information. Research and clinical data for multifactorial likelihood analysis were collated for 1,395 BRCA1/2 predominantly intronic and missense variants, enabling classification based on posterior probability of pathogenicity for 734 variants: 447 variants were classified as (likely) benign, and 94 as (likely) pathogenic; and 248 classifications were new or considerably altered relative to ClinVar submissions. Classifications were compared with information not yet included in the likelihood model, and evidence strengths aligned to those recommended for ACMG/AMP classification codes. Altered mRNA splicing or function relative to known nonpathogenic variant controls were moderately to strongly predictive of variant pathogenicity. Variant absence in population datasets provided supporting evidence for variant pathogenicity. These findings have direct relevance for BRCA1 and BRCA2 variant evaluation, and justify the need for gene-specific calibration of evidence types used for variant classification
    • 

    corecore